Supplemental material for: Efficient coordinate-descent for orthogonal matrices through Givens rotations
نویسنده
چکیده
Definition 1. Riemannian gradient The Riemannian gradient ∇f(U) of f at point U ∈ Od is the matrix UΩ, where Ω ∈ Skew(d), Ωji = −Ωij = ∇ijf(U), 1 ≤ i < j ≤ d is the directional derivative as defined in Eq. 1 of the main text, and Ωii = 0. The norm of the Riemannian gradient ||∇f(U)|| = Tr(∇f(U)∇f(U) ) = ||Ω||fro. Definition 2. A point U∗ ∈ Od is asymptotically stable with respect to Algorithm 1 if it has neighborhood V such that all sequences generated by Algorithm 1 with starting point U0 ∈ V converge to U∗. Theorem 1. Convergence to local optimum (1) The sequence of iterates Ut of Algorithm 1 satisfies: limt→∞ ||∇f(Ut)|| = 0. This means that the accumulation points of the sequence {Ut}t=1 are critical points of f . (2) Assume the critical points of f are isolated. Let U∗ be a critical point of f . Then U∗ is a local minimum of f if and only if it is asymptotically stable with regard to the sequence generated by Algorithm 1.
منابع مشابه
Supplemental material of Coordinate-descent for learning orthogonal matrices through Givens rotations
Theorem 1. Convergence to local optimum (a) The sequence of iterates Ut of Algorithm 4 satisfies: limt→∞ ||∇f(Ut)|| = 0. This means that the accumulation points of the sequence {Ut}t=1 are critical points of f . (b) Assume the critical points of f are isolated. Let U∗ be a critical point of f . Then U∗ is a local minimum of f if and only if it is asymptotically stable with regard to the sequenc...
متن کاملCoordinate-descent for learning orthogonal matrices through Givens rotations
Optimizing over the set of orthogonal matrices is a central component in problems like sparsePCA or tensor decomposition. Unfortunately, such optimization is hard since simple operations on orthogonal matrices easily break orthogonality, and correcting orthogonality usually costs a large amount of computation. Here we propose a framework for optimizing orthogonal matrices, that is the parallel ...
متن کاملEfficient coordinate-descent for orthogonal matrices through Givens rotations
Optimizing over the set of orthogonal matrices is a central component in problems like sparse-PCA or tensor decomposition. Unfortunately, such optimization is hard since simple operations on orthogonal matrices easily break orthogonality, and correcting orthogonality usually costs a large amount of computation. Here we propose a framework for optimizing orthogonal matrices, that is the parallel...
متن کاملNORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET Specifying Gaussian Markov Random Fields with Incomplete Orthogonal Factorization using Givens Rotations
In this paper an approach for finding a sparse incomplete Cholesky factor through an incomplete orthogonal factorization with Givens rotations is discussed and applied to Gaussian Markov random fields (GMRFs). The incomplete Cholesky factor obtained from the incomplete orthogonal factorization is usually sparser than the commonly used Cholesky factor obtained through the standard Cholesky facto...
متن کاملEfficient implementations of pipelined CORDIC based IIR digital filters using fast orthonormal μ-rotations
CORDIC based IIR digital lters are orthogonal lters whose internal computations consist of orthogonal transformations. These lters possess desirable properties for VLSI implementations such as regularity, local connection, low sensitivity to nite word-length implementation, and elimination of limit cycles. Recently, ne-grain pipelined CORDIC based IIR digital lter architectures which can perfor...
متن کامل